Super-Wideband Bandwidth Extension for Wideband Audio Codecs Using Switched Spectral Replication and Pitch Synthesis

نویسندگان

  • Bernd Geiser
  • Hauke Krüger
  • Peter Vary
چکیده

This paper describes a new bandwidth extension algorithm which is targeted at high quality audio communication over IP networks. The algorithm is part of the Huawei/ETRI candidate for the ITU-T super-wideband (SWB) extensions of Rec. G.729.1 and G.718. In the SWB candidate codec, the 7-14 kHz frequency band of speech and audio signals is represented in terms of temporal and spectral envelopes. This description is encoded and transmitted to the decoder. In addition, the fine structure of the input signal is analyzed and compactly encoded. From this compact information, the decoder can regenerate the 7-14 kHz fine structure either by spectral replication or by pitch synthesis. Then, an adaptive envelope restoration procedure is employed. The algorithm operates in the MDCT domain to allow subsequent refinement coding by vector quantization of spectral coefficients. In the paper, relevant listening test results for the G.729.1SWB candidate codec that have been obtained during the ITU-T standardization process are summarized. Good audio quality could be shown for both speech and music signals.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Audio bandwidth extension using ensemble of recurrent neural networks

In audio communication systems, the perceptual audio quality of the reproduced audio signals such as the naturalness of the sound is limited by the available audio bandwidth. In this paper, a wideband to super-wideband audio bandwidth extension method is proposed using an ensemble of recurrent neural networks. The feature space of wideband audio is firstly divided into different regions through...

متن کامل

Artificial Bandwidth Extension of Wideband Speech by Pitch-Scaling of Higher Frequencies

In this paper, a simple DFT-domain pitch-scaling technique is used to extend the audio bandwidth of wideband speech (50Hz – 7 kHz) to the super-wideband range (50Hz – 12 kHz). Therefore, the higher frequencies of the wideband signal (6 – 7 kHz) are pitch-scaled with a scaling factor of four and the resulting, scaled signal is inserted into the 8 – 12 kHz band. A subjective listening test has be...

متن کامل

Bandwidth Extension of Speech Signals: A Comprehensive Review

Telephone systems commonly transmit narrowband (NB) speech with an audio bandwidth limited to the traditional telephone band of 300-3400 Hz. To improve the quality and intelligibility of speech degraded by narrow bandwidth, researchers have tried to standardize the telephonic networks by introducing wideband (50-7000 Hz) speech codecs. Wideband (WB) speech transmission requires the transmission...

متن کامل

From Narrowband Telephony to Wideband Telephony

The restricted audio quality of today’s telephone networks is mainly due to the narrowband (NB) limitation to the frequency range from about 300 Hz to 3.4 kHz. Meanwhile, codecs for wideband (WB) telephony (50 Hz to 7 kHz) exist with significantly improved speech intelligibility and naturalness. However, the broad introduction of wideband speech coding will require strong efforts of both networ...

متن کامل

Subjective voice quality evaluation of artificial bandwidth extension: comparing different audio bandwidths and speech codecs

Artificial bandwidth extension (ABE) methods have been developed to improve the quality and intelligibility of telephone speech. In many previous studies, however, the evaluation of ABE has not fully reflected the use of ABE in mobile communication (e.g., evaluation with clean speech without coding). In this study, the subjective quality of ABE was evaluated with absolute category rating (ACR) ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010